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Abstract. Probabilistic Biichi Automata (PBA) are randomized, finite state automata 
that process input strings of infinite length. Based on the threshold chosen for the accep- 
tance probability, different classes of languages can be defined. In this paper, we present a 
number of results that clarify the power of such machines and properties of the languages 
they define. The broad themes we focus on are as follows. We present results on the 
decidability and precise complexity of the emptiness, universality and language contain- 
ment problems for such machines, thus answering questions central to the use of these 
models in formal verification. Next, we characterize the languages recognized by PBAs 
topologically, demonstrating that though general PBAs can recognize languages that are 
not regular, topologically the languages are as simple as oj-regular languages. Finally, we 
introduce Hierarchical PBAs, which are syntactically restricted forms of PBAs that are 
tractable and capture exactly the class of tj-regular languages. 



Automata on infinite (length) strings have played a central role in the specification, mod- 
eling and verification of non-terminating, reactive and concurrent systems |V\V<S(il |Kur94l 




IVWS83I IHP961 15is83j . However, there are classes of systems whose behavior is probabilistic 
in nature; the probabilistic behavior being either due to the employment of randomization 
in the algorithms executed by the system or due to other uncertainties in the system, 
such as failures, that are modeled probabilistically. While Markov Chains and Markov 
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Decision Processes have been used to model such behavior in the formal verification com- 
munity [RKNP04J, both these models do not adequately capture open, reactive probabilistic 
systems that continuously accept inputs from an environment. The most appropriate model 
for such systems are probabilistic automata on infinite strings, which are the focus of study 
in this paper. 

Probabilistic Biichi Automata (PBA) have been introduced in [BG05| to capture such 
computational devices. These automata generalize probabilistic finite automata (PFA) 
[Rab63| Sal73j lPaz71| from finite length inputs to infinite length inputs. Informally, PBAs 
are like finite-state automata except that they differ in two respects. First, from each 
state and on each input symbol, the PBA may roll a dice to determine the next state. 
Second, the notion of acceptance is different because PBAs are probabilistic in nature and 
have infinite length input strings. The behavior of a PBA on a given infinite input string 
can be captured by an infinite Markov chain that defines a probability measure on the 
space of runs/executions of the machine on the given input. Like Biichi automata, a run 
is considered to be accepting if some accepting state occurs infinitely often, and therefore, 
the probability of acceptance of the input is defined to be the measure of all accepting runs 
on the given input. There are two possible languages that one can associate with a PBA 
B [BG05, BBG08J - £>o(i?) (called probable semantics) consisting of all strings whose 
probability of acceptance is non-zero, and C=i(B) (called almost sure semantics) consisting 
of all strings whose probability of acceptance is 1. Based on these two languages, one can 
define two classes of languages - L(PBA >0 ), and L(PBA =1 ) which are the collection of 
all languages (of infinite length strings) that can be accepted by some PBA with respect 
to probable, and almost sure semantics, respectively. In this paper we study the expressive 
power of, and decision problems for these classes of languages. 

We present a number of new results that highlight three broad themes. First, we 
establish results on decidability and precise complexity of the canonical decision problems 
in verification, namely, emptiness, universality, and language containment, for the classes 
L(PBA >0 ) and L(PBA =1 ). For the decision problems, we focus our attention on RatPBAs 
which are PBAs in which all transition probabilities are rational. For RatPBAs B and £>', 
our results are as follows. 

(A) Checking if C =1 (B) = and C =1 (B) = S* are PSPACE-complete. 

(B) The problems of checking if C > o(B) = 0, and if £>o(i?) = X* are SP>-complete. 

(C) The problems of checking if C =1 (B) C C =1 (B') and if £ >0 (B) C C >Q (B') are Un- 
complete. 

The decidability of the universality checking of C=\(B) (bullet (A) above) is a new result. 
The result establishing the PSPACE-completeness of emptiness checking of C=\(B) (bullet 
(A) above) substantially improves the result of [BBG08; where it was shown to be decidable 
in EXPTIME and conjectured to be EXPTIME-hard. The improved upper bound for 
emptiness checking is established by observing that the complement of the language C=\{B) 
is recognized by a special PBA Ad (with probable semantics) called a finite state probabilistic 
monitor (FPM) [CSV081 [CSV09a] and then exploiting a result in [CSV09a| that shows that 
the language of an FPM is universal if and only if there is an ultimately periodic word in 
the complement of the language recognized by a FPM. This observation of the existence of 
ultimately periodic words does not carry over to the class L(PBA >0 ). However, we show that 
£>o($) is non-empty iff it contains a strongly asymptotic word, which is a generalization of 
ultimately periodic word. This allows us to show that the emptiness problem for L(PBA >0 ), 
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though undecidable as originally shown in [BBG08], is S^-complete (bullet (B) above), 
where El] is a set in the second level of the arithmetic hierarchy. This result is noteworthy 
because typically problems of automata on infinite words that are undecidable tend to 
lie way beyond the arithmetic hierarchy in the analytical hierarchy. Finally, given that 
the emptiness and universality problems for L(PBA =1 ) are in PSPACE (bullet (A)), one 
would expect language containment under almost sure semantics to be at least decidable. 
However, surprisingly, we show that it is, in fact, S^-complete (bullet (C) above). 

The second theme brings to sharper focus the correspondence between nondeterminism 
and probable semantics, and between determinism and almost sure semantics, in the con- 
text of automata on infinite words. This correspondence was hinted at in [BBG08]. There 
it was observed that L(PBA =1 ) is a strict subset of L(PBA >0 ) and that while Biichi, Rabin 
and Streett acceptance conditions all yield the same class of languages under the probable 
semantics, they yield different classes of languages under the almost sure semantics. These 
observations mirror the situation in non-probabilistic automata — languages recognized 
by deterministic Biichi automata are a strict subset of the class of languages recognized 
by nondeterministic Biichi automata, and while Biichi, Rabin and Streett acceptances are 
equivalent for nondeterministic machines, Biichi acceptance is strictly weaker than Rabin 
and Streett for deterministic machines. In this paper we further strengthen this corre- 
spondence through a number of results on the closure properties as well as the topological 
structure of L(PBA >0 ) and L(PBA =1 ). 

First we consider closure properties. It was shown in [BBG08J that the class L(PBA >0 ) 
is closed under all the Boolean operations (like the class of languages recognized by nonde- 
terministic Biichi automata) and that L(PBA =1 ) is not closed under complementation. We 
extend these observations as follows. 

(D) L(PBA =1 ) is closed under intersection and union. 

(E) Every language in L(PBA >0 ) can be expressed as the Boolean combination of languages 
in L(PBA =1 ). 

These results mimic similar observations about Biichi automata — the class of languages 
recognized by deterministic Biichi automata is closed under union and intersection, but not 
complementation; and, any w-regular language (or languages recognized by nondeterministic 
Biichi machines) can be expressed as the Boolean combination of languages recognized by 
deterministic Biichi automata. 

Next, we characterize the classes topologically. There is a natural topological space 
on infinite length strings called the Cantor topology |Tho90| . We show that, like w-regular 
languages, all the classes of languages defined by PBAs lie in very low levels of this Borel 
hierarchy. We show that- 

(F) L(PBA =1 ) is strictly contained in Q$, just like the class of languages recognized by 
deterministic Biichi is strictly contained in Q$. 

(G) L(PBA >0 ) is strictly contained in the Boolean closure of Gs rnuch like the case for 
w-regular languages. 

The last theme identifies syntactic restrictions on PBAs that captures regularity. Much like 
PFAs for finite word languages, PBAs, though finite state, allow one to recognize non-regular 
languages. It has been shown [BG051 IBBG08] that both L(PBA >0 ) and L(PBA =1 ) contain 
non-w-regular languages. A question initiated in [BG05] was to identify restrictions on PBAs 
that ensure that PBAs have the same expressive power as finite-state (non-probabilistic) 
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machines. One such restriction was identified in |BG05| . where it was shown that uni- 
form PBAs with respect to the probable semantics capture exactly the class of w-regular 
languages. However, the uniformity condition identified by Baier et. al. was semantic in 
nature. In this paper, we identify one simple syntactic restriction (i.e., one that is based 
only on the local transition structure of the machine, and can be efficiently checked) that 
captures regularity both for probable semantics and almost sure semantics. Not only, the 
restricted PBAs capture the notion of regularity, they are also very tractable. 

The restriction we consider is that of a hierarchical structure. A Hierarchical PBA 
(TiPBAj is a PBA whose states are partitioned into different levels such that, from any 
state q, on an input symbol a, at most one transition with non-zero probability goes to a 
state at the same level as q and all others go to states at higher level. We show that - 
(H) HPBAs with respect to probable semantics define exactly the class of w-regular lan- 
guages. 

(I) HPBAs with respect to almost sure semantics define exactly the class of w-regular 

languages in L(PBA =1 ), namely, those recognized by deterministic Biichi automata. 
(J) Emptiness and universality problems for probable semantics for HPBAs with rational 

transition probabilities are NL-complete and PSPACE-complete, respectively. 
(K) Emptiness and universality problems for almost sure semantics for HPBAs with rational 

transition probabilities PSPACE-complete and NL-complete, respectively. 
The complexity of decision problems for HPBAs under probable semantics is interesting 
because this is the exact same complexity as that for (non-probabilistic) Biichi automata. 
In contrast, the emptiness problem for uniform PBA has been shown to be in EXPTIME 
and co-NP-hard |BG05| : thus, they seem to be less tractable than HPBA. 

The rest of the paper is organized as follows. After discussing closely related work, 
we start with some preliminaries (in Section [2]) before introducing PBAs. We present our 
results about the probable semantics in Section [3l and almost sure semantics in Section 
Hierarchical PBAs are introduced in Section [5l and conclusions are presented in Section [6j 

Related Work. Probabilistic Biichi automata (PBA), introduced in |BG05j . gener- 
alize the model of Probabilistic Finite Automata [Rab63 ; Sal73 j IFaz71j to consider inputs 
of infinite length. In [BG05], Baier and Grofier only considered the probable semantics for 
PBA. They also introduced the model of uniform PBAs to capture w-regular languages and 
showed that the emptiness problem for such machines is in EXPTIME and co-NP-hard. 
The almost sure semantics for PBA was first considered in [BBG08 where a number of 
results were established. It was shown that L(PBA >0 ) are closed under all Boolean opera- 
tions, L(PBA =1 ) is strictly contained in L(PBA >0 ), the emptiness problem for L(PBA >0 ) 
is undecidable, and the emptiness problem of L(PBA =1 ) is in EXPTIME. We extend and 
sharpen the results of this paper. In a series of previous papers [CSVOH [CSV09aJ, we con- 
sidered a special class of PBAs called FPMs (Finite state Probabilistic Monitors) whose 
accepting set of states consists of all states excepting a rejecting state which is also absorb- 
ing. There we proved a number of results on the expressiveness and decidability /complexity 
of problems for FPMs. We draw on many of these observations to establish new results for 
the more general model of PBAs. 

An extended abstract of this paper appeared in [CSV09b]. Several proofs were omitted 
in [CSV09b] for lack of space, and the current version includes all of these proofs. 
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2. Preliminaries 

The set of natural numbers will be denoted by N, the closed unit interval by [0, 1] and the 
open unit interval by (0, 1). The power-set of a set X will be denoted by 2 X . 

Sequences. Given a finite set S, \S\ denotes the cardinality of S. Given a sequence (finite 
or infinite) k = sq,si,... over S, \k\ will denote the length of the sequence (for infinite 
sequence |«| will be uj), and «[£] will denote the ith element s, of the sequence. As usual 
S* will denote the set of all finite sequences/strings/words over S, S + will denote the set 
of all finite non-empty sequences/strings/ words over S and S u will denote the set of all 
infinite sequences/strings/words over S. Given ry E S* and k £ S* U S w , tjk is the sequence 
obtained by concatenating the two sequences in order. Given l_i C S* and l_2 Q the 
set l_il_2 is defined to be {r]K | rj G l_i and k € L2}. Given natural numbers i,j < \k\, n[i : j] 
is the finite sequence Sj, . . .Sj, where Sk = n[k}. The set of finite prefixes of k is the set 
Pref( K ) = {K[0,j]\j €N,j <\k\}. 

Arithmetical Hierarchy. Let V be a finite alphabet. A language L over T is a set of 
finite strings over T. Arithmetical hierarchy consists of classes of languages 11° for 
each integer n > 0. Fix an n > 0. A language L € S° iff there exists a recursive predicate 
(j>(u,xi, ■~,x n ) where u is a variable ranging over T*, and for each i,0 < i < n, Xi is a finite 
sequence of variables ranging over integers such that 

L = {« £ T 3fi, Vf 2 , • • • , QnX n <f>(u,X*l, ...,X n )} 

where Q n is an existential quantifier if n is odd, else it is a universal quantifier. Note that the 
quantifiers in the above equation are alternating starting with an existential quantifier. The 
class 11° is exactly the class of languages that are complements of languages in S2 . ^-i 
are exactly the class of R.E.-sets and co-R.E.-sets. A canonical E5" con rpl e t; e □ language 
is the set of deterministic Turing machine encodings that halt on some input string. A well 
known 5]2~ com Pl e t e language is the set of deterministic Turing machine encodings that halt 
on finitely many inputs. 

Languages of infinite words. A language L of infinite words over a finite alphabet £ 
is a subset of (Please note we restrict only to finite alphabets). A set of languages of 
infinite words over £ is said to be a class of languages of infinite words over S. Given a 
class £, the Boolean closure of C, denoted BCI(£), is the smallest class containing C that 
is closed under the Boolean operations of complementation, union and intersection. 

Automata and w-regular Languages. A finite automaton on infinite words, A, over a 
(finite) alphabet S is a tuple (Q, qo, F, A), where Q is a finite set of states, ACQxExQis 
the transition relation, go £ Q is the initial state, and F defines the accepting condition. The 
nature of F depends on the type of automaton we are considering; for a Biichi automaton 
F Q Q, while for a Rabin automaton F is a finite subset of 2^ x 2®. If for every q € Q 
and a € X, there is exactly one q' such that (q,a,q') € A then A is called a deterministic 
automaton. Let a = ao, a%, . . . be an infinite string over S. A run r of A on a is an infinite 
sequence sq,s\, . . . over Q such that sq = qo and for every i > 0, (sj,aj,Sj+i) € A. The 
notion of an accepting run depends on the type of automaton we consider. For a Biichi 
automaton, r is accepting if some state in F appears infinitely often in r. On the other hand 
for a Rabin automaton, r is accepting if it satisfies the Rabin acceptance condition — there 



Let C be a class in the arithmetic hierarchy. L £ C is said to be C-complete if L 6 C, and for every 
L' £ C there is a computable function / such that x £ L' iff f(x) £ L. 
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Figure 1: The Borel Hierarchy. Inclusions from left to right are strict. 

is some pair (Bi, G{) £ F such that all the states in Bi appear only finitely many times in r, 
while at least one state in G{ appears infinitely many times. The automaton A accepts the 
string a if it has an accepting run on a. The language accepted (recognized) by A, denoted 
by C(A), is the set of strings that A accepts. A language L C is called co-regular iff there 
is some Biichi automata A such that C(A) = L In this paper, given a fixed alphabet S, we 
will denote the class of w-regular languages by Regular. It is well-known that unlike the case 
of finite automata on finite strings, deterministic Biichi automata are less powerful than 
nondeterministic Biichi automata. On the other hand, nondeterministic Rabin automata 
and deterministic Rabin automata have the expressive power and they recognize exactly 
the class Regular. Finally, we will sometimes find it convenient to consider automata A that 
do not have finitely many states. We will say that a language L is deterministic iff it can be 
accepted by a deterministic Biichi automaton that does not necessarily have finitely many 
states. We denote by Deterministic the collection of all deterministic languages. Please 
note that the class Deterministic strictly contains the class of languages recognized by finite 
state deterministic Biichi automata. The following are well-known results [ PP04 , T_ho90j . 

Proposition 2.1. L € Regular n Deterministic iff there is a finite state deterministic Biichi 
automaton A such that C(A) = L. Furthermore, Regular n Deterministic C Regular and 
Regular = BCI(Regular n Deterministic). 

Topology on infinite strings. The set E w comes equipped with a natural topology called 
the Cantor topology. The collection of open sets is the collection Q = {LS^ | L C 
The collection of closed sets, T, is the collection of prefix-closed sets — L is prefix-closed if 
for every infinite string a, if every prefix of a is a prefix of some string in L, then a itself 
is in L. In the context of verification of reactive systems, closed sets are also called safety 
languages |Lam85t [AS85J . 

Borel Hierarchy on the Cantor space. For a class C of languages, we define Cg = 
{nje^Lj | Lj £ £} and C a = {Uj G ^Lj | L« £ £}. The set of open sets of the Cantor space 
is closed under arbitrary unions but only finite intersections. Similarly the set of closed 
sets of the Cantor union is closed arbitrary intersections but only finite unions. The Borel 
hierarchy of the Cantor space is obtained by the means of countable unions, intersections 
and complementation, and is shown in Figure Q3 This yields a transfinite hierarchy, but we 
will restrict our attention to the first few levels. At the lowest level of this hierarchy is the 
collection Q n J- which is strictly contained in both Q and T which form the next level of 
the hierarchy. Both Q and T are strictly contained in the collection Qg n J- a which forms 
the next level. The collection QgC\T a is strictly contained in Qg and T a which is at the next 

2 This topology is also generated by the metric d : E w x E" — > [0, 1] where d(a, (3) is iff a = /3; otherwise 
it is ~ where i is the smallest integer such that a[i] ^ /3[i]. 
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level. Qg and J~ a are strictly contained in Qg a H J- a g which itself is strictly contained in both 
Qg a and J- a g- One remarkable result in automata theory is that the class of languages Qg 
coincides exactly with the class of languages recognized by infinite-state deterministic Biichi 
automata [Lan69[ IPP044 ITho90| . This combined with the fact that the class of w-regular 
languages is the Boolean closure of w-regular deterministic Biichi automata yields that the 
class of w-regular languages is strictly contained in BC\(Qg) which itself is strictly contained 
mg 5a nT a5 [PP04llTho90| . 

Proposition 2.2. Qg = Deterministic, and Regular C BC\(Qg) C Qg a n F a $. 

2.1. Probabilistic Biichi automata. We shall now recall the definition of probabilistic 
Biichi automata given in [BG05]. Informally, PBAs are like finite-state deterministic Biichi 
automata except that the transition function from a state on a given input is described as a 
probability distribution that determines the probability of the next state. PBAs generalize 
the probabilistic finite automata (PFA) [Rab63[ ISal73[ IPaz71| on finite input strings to 
infinite input strings. Formally, 

Definition 2.3. A finite state probabilistic Biichi automata (PBAJ over a finite alphabet 
£ is a tuple B = (Q,q s ,Qf,5) where Q is a finite set of states, q s G Q is the initial state, 
Qf Q Q is the set of accepting /final states, and 5 : Q x £ x Q — > [0,1] is the transition 
relation such that for all q G Q and a G S, Ylq'eQ ^(Qj a > l') = 1- ^ n addition, if 5(q, a, q') is 
a rational number for all q,q' G Q,a G S, then we say that A 7 ! is a rational probabilistic 
Biichi automata (RatPBA). 

Notation: The transition function 5 of PBA B on input a can be seen as a square matrix 
5 a of order \Q\ with the rows labeled by "current" state, columns labeled by "next state" 
and the entry S a (q,q') equal to 5{q,a,q'). Given a word u = a$ai ...a n £ S + , 5 U is the 
matrix product S ao S ai . . . 5 an . For an empty word e S S* we take 5 e to be the identity matrix. 
Finally for any Qq C Q, we define 5 u (q, Qq) = Yl q '£Q 3u(q,q')- Given a state q G Q and a 
word u G S + , post(g, u) = {q' \ S u (q,q') > 0}. 

Intuitively, the PBA starts in the initial state q s and if after reading ao, cti • • • , a,i results 
in state q, then it moves to state q' with probability S ai+1 (q, q') on symbol Oj+i. Given a 
word a G the PBA B can be thought of as a infinite state Markov chain which gives 
rise to the standard cr-algebra defined using cylinders and the standard probability measure 
on Markov chains [Var85, KS76] as follows. Given a word a G S w , the probability space 
generated by B and a is the probability space (Q u \Ts,ai A*B,a) where 

• Fs,a is the smallest <T-algebra on Q u generated by the collection {C^ | n G Q + } where 
C v = {p G | 7/ is a prefix of p}. 

• PB,a is the unique probability measure on (Q u ,TB, a ) such that PB,a{^q ...q n ) is 

- if g ± Qs, 

- 1 if n = and go = Qs, and 

- 8(qo,a(0),qi)...6(q n -i,a(n-l),q n ) otherwise. 

A run of the PBA B is an infinite sequence p G A run p is accepting if G Q/ for 
infinitely many i. A run /) is said to be rejecting if it is not accepting. The set of accepting 
runs and the set of rejecting runs are measurable [Var85j. Given a word a, the measure 
of the set of accepting runs is said to be the probability of accepting a and is henceforth 
denoted by p^ c a ', and the measure of the set of rejecting runs is said to be the probability 
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of rejecting a and is henceforth denoted by fi B a . Clearly fi^ c Q + fJ, B a = 1. Following, 
[BG05, BBGQ8], a PBA B on alphabet £ defines two semantics: 

• £>o(0) = {a £ S u Mlra - > *-*}, henceforth referred to as the probable semantics of B, and 

• £=\{B) = {a £ ^ | /ig c< Q, = 1}, henceforth referred to as the almost-sure semantics of B. 
This gives rise to the following classes of languages of infinite words. 

Definition 2.4. Given a finite alphabet £, 

• L(PBA >0 ) = {L C S w 3PBA B. L = £> (B)}; 

• L(PBA =1 ) = {L C S u 3PBA B. L = £=i(B)}. 

Probabilistic Rabin automaton. Analogous to the definition of a PBA and RatPBA, 
one can define a Probabilistic Rabin automaton PRA and RatPRA [BBG08, Gr608j; where 
instead of using a set of final states, a set of pairs of subsets of states is used. A run in 
that case is said to be accepting if it satisfies the Rabin acceptance condition. It is shown 
in [BBG081 IGro08| that PRAs have the same expressive power under both probable and 
almost-sure semantics. Furthermore, it is shown in [BBG08, IGr608| that for any PBA B, 
there is PRA TZ such that a word a is accepted by TZ with probability 1 iff a is accepted 
by B with probability > 0. All other words are accepted with probability by TZ. 

Proposition 2.5 f |BBG08| u For any PBA B there is a PRA TZ such that £ >0 (B) = 
£ >0 {lZ) = C =1 {K) and £= (B) = £= (TZ). Furthermore, if B is a RatPBA then TZ is a 
RatPRA and the construction of TZ is recursive. 

Finite probabilistic monitors (FPM)s. We identify one useful syntactic restriction of 
PBAs, called finite probabilistic monitors (FPM)s. In an FPM, all the states are accepting 
except a special absorbing reject state. We studied them extensively in [CSV081 [CSV09a . 

Definition 2.6. A PBA M = (Q, q s ,Qf,S) on S is said to be an FPM if there is a state 
q r € Q such that q r ^ q s , Q f = Q\ {q r } and 6(q r , a, q r ) = 1 for each a € S. The state q r 
said to be the reject state of Ai. If in addition M is a RatPBA, we say that Ai is a rational 
finite probabilistic monitor (RatFPM). 

3. Probable semantics 

In this section, we shall study the expressiveness of the languages contained in L(PBA >0 ) as 
well as the complexity of deciding emptiness and universality of £>o($) for a given RatPBA 
B. We assume that the alphabet S is fixed and contains at least two letters. 

3.1. Expressiveness. We shall establish new expressiveness results for the class L(PBA >0 )- 

• We show that although the class L(PBA >0 ) strictly contains w-regular languages [BG05J, 
it is not topologically harder. More precisely, we will show that for any PBA B, £>o(i3) 
is a BCI(^)-set. This will be a consequence of following facts. 

(a) L(PBA >0 ) = BCI(L(PBA =1 )) (see Theorem El]) . 

(b) L(PBA =1 ) C g s (see Lemma 02]). 

• However, there are BCI(^) sets that are not in L(PBA >0 ) (see Lemma 13.31) . 
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Figure 2: Relationship between languages recognized by PBAs and sets in the Borel hi- 



erarchy defined by the Cantor topology. Arrows indicate strict containment. 
'Det. Reg.' refers to the class of languages recognized by deterministic Biichi 
automata, while 'Regular' refers to the class of w-regular languages. Contain- 
ment arrows with label 1 were proved in }BG05j and those labelled 2 were proved 
in [BBG08 . Results relating the classes Regular and Det. Reg. are classical 



results; see survey [PP041 ITho90| . Containment arrows with label 3 and the 
equality BCI(L(PBA =1 )) = L(PBA >0 ) are proved in this paper. 

Our expressiveness results are summarized in Figure EJ 

We first show that just as the class of w-regular languages is the Boolean closure of 
the class of w-regular recognized by deterministic Biichi automata, the class L(PBA >0 ) 
coincides with the Boolean closure of the class L(PBA =1 ). This is the content of the following 
theorem whose proof is of independent interest and shall be used later in establishing that 
the containment of languages of two PBAs under almost-sure semantics is undecidable (see 
Theorem 14. 5[) . 

Theorem 3.1. L(PBA >0 ) = BCI(L(PBA =1 )). 

Proof. First observe that it was already shown in [BBG08| that L(PBA =1 ) C L(PBA >0 ). 
Since L(PBA >0 ) is closed under Boolean operations, we get BCI(L(PBA =1 )) C L(PBA >0 ). 
We have to show the reverse inclusion. 

It suffices to sho w that given a P BA B, the language £ >0 (B) € BCI(L(PBA =1 )). Fix B. 
Recall that results of [BBG08, Gr608] (see Proposition ^. 5ft imply that there is a probabilistic 
Rabin automaton (PRA) TZ such that 1) £ >0 (B) = C =1 (TZ) = C >Q (TZ) and 2) C =0 (B) = 
£ =0 (K). Let TZ = (Q, q s , F, 5) where F C 2*3 x 2 Q is the set of the Rabin pairs. Assuming 
that F consists of n-pairs, let F = {(-Bi, Gi), . . . , (B n , G n )}. 

Given an index set IC {1, . . . , n}, let Goodj = U re jG r . Let TZx be the PBA obtained 
from 1Z by taking the set of final states to be Goodj. In other words, TZx = {Q-, Qs, Goodj, 5). 
Given 1 C {l,...,n} and an index j € X, let Badxj = Bj U (U re x,r^jGV). Let TZ X 
be the PBA obtained from TZ by taking the set of final states to be Badjj, i.e., TZ° X = 
(Q,q s , Badjj, <5). The result follows from the following claim. 



Claim: 



U C =1 (TZx)n(^\C =1 (TZ{)). 



ZC{l,...,n},jeZ 
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Proof of the claim: Given X C {1, . . . , n}, j G X, let l x ,j = £=i(R-i) n (S w \ £=i(^)). 
We will say that a run p of PRA 1Z satisfies the Rabin pair (B r , G r ) if all states in B r occur 
only finitely many times in p and at least one state in G r occurs infinitely often in p. 

We first show that Ljj C £ >0 (7Z). Fix any a G \-x,y Since l_xj Q £=i(TZx), it follows 
that on input a the measure of runs that visit the set Goodj = UjgjGj infinitely often must 
be 1. On the other hand, as l_xj fl C=i(TZ x ) = 0, it follows that on input a the measure 
of runs that visit Badjj = Bj U (Ujgx^jGj) only finitely many times has strictly positive 
measure. Since Goodj\ Badjj C it now follows from the previous two observations that 
the measure of runs that visit Gj infinitely often but visit Badxj only finitely many times is 
strictly positive. Since Bj C Badxj, we get that the set of runs that satisfy the Rabin pair 
(Bj,Gj) has non-zero measure on input a. Therefore, we have that l_x,j C £>o(7£). But, 
we have that £>o(72.) = £=i(1Z) = C > q(B). Hence, we get 

U L XJ C £ >0 (B). 

XC{l,...,n},jeX 

We will be done if we can show the reverse inclusion. Thus, given word a in £>o(£>), we 
have to construct X and j such that a G \-i j- We construct them as follows. First, let 
I be the set of all indices r such that the measure of all runs that satisfy the Rabin pair 
(B r ,G r ) on input a is > 0. X is non-empty (since a G C=i(7V))- Clearly, we have that on 
input a, the measure of runs such that Goodj is visited infinitely often is 1 (again, since 

a G £=i(TZ)). In other words, a G C=i(1Zj). Required X will be a subset of X and will be 
constructed by induction as follows. 

At step 1 of the induction, we pick an arbitrary index r in I. Then we check if it is the 
case that on a, the probability of visiting G r infinitely often in 1Z is 1. Note that it is the 
case that the probability that B r is visited infinitely often in 7Z is < 1 (as a satisfies (B r , G r ) 
with non-zero probability). Note that this implies that a G C=i(lZ{ r y) n (S w \ £=i(1Z^ r y) 
and the induction stops at this point. If it is not the case, then let X\ = {r}. 

Proceed by induction. At step m, we would have produced an index set X m C X 
such that on a, we have that a ^ C = i{lZx m ) (meaning the set of runs which visit Goodj m 
infinitely often have probability < 1). Now since a is accepted by PRA 1Z with probability 
1, there must be some index r in X\X m such that the set of runs that satisfy (B r , G r ) and 
visit Good/ m only finitely many times is > 0. Fix one such r. Now, there are two cases. 

(1) On the input a, the set of runs that visit Goodx m U G r infinitely often has measure 1. 
In that case, by construction, we also have that a G \-x m u{r},r an d induction stops. 

(2) Otherwise, we let X m+ \ = X m U {r} and proceed. 

The induction must stop at a finite point at which we will satisfy the required condition 
(since a G C = \(TZj)). □ 

The second component needed for showing that L(PBA >0 ) C BCI^) is the fact that 
for any PBA B and x G [0, 1], the language C> X {B) is a ^5-set; which we prove next. 

Lemma 3.2. For any PBA B and x G [0, 1], £> X (B) is a Qs set. 

Proof. Let B = (Q,q s ,Q f ,5). Now given k > 0, let Paths fe C S w be the set of all infinite 
runs which start at the state q s and visit the set of final states at least fc-times. Let 
Paths w be the set of all infinite runs which start at the state q s and visit the final states 
infinitely often. Formally, Paths'" = {p G Q u \ p[0] = q s and \{i G N | p[i] G Qf}\ > k} and 
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Paths" = {p G Q w | p[0] = q s and |{t G N | p[i] £ Q f }\ = oj}. We have that Path k , k > 
forms a decreasing sequence and 

n fceN ,fc>oPaths fc = Paths". 
From standard probability theory, we get that for any word a, 

lim /i B)Q ,(Paths fc ) = /i Ba ( Paths") 

where /xb iQ: is the probability measure generated by the infinite word a and PBA B. From 
this, we immediately see that an infinite word a is accepted with probability at least x iff 
for all k > the probability of visiting the set of final states on input a at least fc-times 
> x. In other words, 

{a G S" | ^ >x} = n fceNjfc>0 {a G S" | /i BiQ (Paths fc ) > s}. 

Hence, it suffices to show that for each k G N, k > the set {a G X" | /ig iQ (Paths fe ) > x} is 
a ^ set. Note that for each A; > 0, 

{a G S" I p B a (Paths fe ) > x} = n neN {a G E w I z/ B a (Paths fc ) > a; }. 

n 

Hence, it suffices to show that for each /c G N, nGN, £; > the set {a G S" | /ig ja (Paths fc ) > 
x — ^} is a C?-set. In order to see that this is the case, given k > and £ > 0, let Path k,i C S" 
be the set of infinite runs that start at the initial state and visit Qf at least A; times in the 
first I steps. Formally, Paths M = {p G Q" | p[0] = <? s and |{i G N,i < ^ | G > A}. 
Now, the result follows immediately from the observation that 

{a G S" | /i Ba (Paths fc ) > x - -} = U teN {a G S w | p B a (Paths M ) > x - -} 

n n 

and the observation that each of the set {a G S" | pg !Q (Paths fc '^) > x — ^} is a (/-set. □ 

Using Lemma [3T2l one immediately gets that L(PBA >0 ) C BCI^). Even though PBAs 
accept non-w-regular languages, they cannot accept all the languages in BC\(Qs)- 

Lemma 3.3. Regular C L(PBA >0 ) C BC\(Q S ). 

Proof. Note that Regular C L(PBA >0 ) follows immediately from results of [BG05] . Thanks 
to Lemma E21 we also have that L(PBA =1 ) C Q s . Since L(PBA >0 ) = BCI(L(PBA =1 )) 
(see Theorem 13. ip . we get that L(PBA >0 ) C BCI (Cy^) . We only have to show that this 
containment is strict. The proof of this fact utilizes the following result which shows that 
for any L G L(PBA >0 ), the smallest safety language containing L is guaranteed to be ir- 
regular even if L is nottj 

Claim: For any PBA £>, let cl(L) be the smallest safety language containing L = £>o(£>). 
Then cl(L) is w-regular. 

Proof of the claim: Without loss of generality, we can assume that L ^ 0. Let B = 
(Q, q s , Qf,5). Given q G Q, let B q be the PBA which is exactly like B, except that the initial 
state is q. That is B q = (Q,q,Qf,5). Let Q > q C Q be the set of states {q \ 3a.pQ^ a > 0}. 
Consider the finite state Biichi automata A = (Q>o> qs, Q>o, A) where (q±,a, (72) G A iff 



As arbtitrary intersection of safety languages is also a safety language, for every language L, there is a 
smallest safety language containing L. Topologically, this is the closure of L. 
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Emptiness 


Universality 


Containment 


L(PBA >0 ) 


£ 2 l -complete (t) 


E2-complete (t) 


E^-complete (t) 


L(PBA =1 ) 


PSPACE-complete m) 


PSPACE-complete 


E^-complete 



Figure 3: Hardness of decision problems for RatPBAs. ^The problems of checking empti- 
ness, universality and containment for probable semantics was shown to be R.E.- 
hard in [BBG08]. ^Hhe problem of checking emptiness of almost sure semantics 
was shown to decidable in EXPTIME in [BBG08| . 



8{q\,a,q2) > 0. It is easy to see that cl (L) is exactly the language recognized by A. This 
implies that cl(L) is w-regular.(End of the claim) □ 

We proceed as follows. Fix two letters a, b of the alphabet X and consider the language 
L consisting of exactly one word a = abaabb . . . a*6*a 4+1 6 t+1 . . .. Now, cl (L) = L (every single 
element set in a metric space is a closed set) and L is not w-regular (since L does not contain 
any periodic word). Therefore, the closed set L is not in the class L(PBA >0 ) (note that 

Le^asJc g 5 ). □ 



3.2. Decision problems. For the rest of this section, we shall focus our attention on deci- 
sion problems for probable semantics for RatPBAs. Results of this section are summarized 
in the first row of Figure [3] and stated in Theorem l3.9[ Given a RatPBA B, the problems of 
emptiness and universality of C > o(B) are known to be undecidable [BB G08| . We sharpen 
this result by showing that these problems are X^-complete. This is interesting in the light 
of the fact that problems on infinite string automata that are undecidable tend to typically 
lie in the analytical hierarchy, and not in the arithmetic hierarchy. 

Before we proceed with the proof of the upper bound, let us recall an important property 
of finite-state Biichi automata |Tho90[ IPP04| . The lan guage recognized by a finite-state 
Biichi automaton A is non-empty iff there is a final state qj of A, and finite words u and 
v such that qt is reachable from the initial state on input u, and qj is reachable from 
the state qj on input v. This implies that any non-empty w-regular language contains an 
ultimately periodic word. We had extended this observation to FPMs in [CSV081 ICSV09a| . 
In particular, we had shown that the language £ >X (M) is non-empty for a given Ai iff there 
exists a set of final states C of A4 and words u and v such that the probability of reaching C 
from the initial state on input u is > x and for each state q 6 C the probability of reaching 
C from q on input v is 1. This immediately implies that if C >X (M) is non-empty then 
£->x{M) must contain an ultimately periodic word. In contrast, this fact does not hold for 
non-empty languages in L(PBA >0 ). In fact, Baier and Grofier [BG05| . construct a PBA B 
such that £>o(i?) does not contain any ultimately periodic word. 

However, we will show that even though the probable semantics of a PBA may not 
contain an ultimately periodic, they nevertheless are restrained in the sense that they must 
contain a strongly asymptotic word. In order to define strongly asymptotic words formally, 
we introduce the following notation- 
Notation: Let B = (Q, q s , Qf, 5). Given C C Q, q £ C and a finite word u = aoa± . . . a*. G 
E+, let 5$ f (q,C) = Z q > e c S u f (q, q') where 

&M) = E ({(?} u W} u {q . | i < i < k]) nQf + ^ (9, 91)^(91,®) • ••M9*,9')- 
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Informally, in the above notation 8 U f (q, C) is the probability that the PBA B, when started 
in state q, on the input string u, is in some state in C at the end of u after passing through 
a final state. We can now define strongly asymptotic words. 

Definition 3.4. Given a PBA B = (Q, q s , Qf, 5) and a set C of states of B, a word a G S w 
is said to be strongly asymptotic with respect to B and C if there is an infinite sequence 
ii < %2 < .... such that 

(1) $a[0:h](qs,C) > and 

(2) all j > and for all q G C, (q, C) > 1 - i 

A word a is said to be strongly asymptotic with respect to B if there is some C such that a 
is strongly asymptotic with respect to B and C. 

We will now show that if the probable semantics of a PBA is non-empty then it must 
contain a strongly asymptotic word. We need one more notation. 

Notation: Reach(B,C,x) denotes the predicate 3u G T, + .5 u (q s ,C) > x. 
Intuitively, the predicate Reach(B, C, x) is true iff there is some finite non-empty string u, 
such that the probability of being in C having started from the initial state q s and after 
having read u is > x. The existence of strongly asymptotic word in probable semantics is 
an immediate consequence of the following Lemma. 

Lemma 3.5. Let B = (Q,q s ,Q f ,6). For any x G [0, 1), £ >X (B) / iff 3C C Q such that 
Reach(B, C, x) is true and for all j > there is a finite non-empty word Uj such that for all 

q eC.5% f (q,C)>(l-±). 

Proof. (<=) Note that it is a well-known fact that the product rij*Li(l ~~ fr) converges and 
is > 0. Assume now that 3C C Q such that Reach(B, C, x) is true and for all j > there 
is a finite word Uj such that Vg G C .5uf(q,C) > (1 — -—). Since Reach(B,C,x) is true, 
there is a finite word u such that 5 u (q s , C) > x. Fix u. Also for each j > 0, fix Uj such that 

VgGC, andtfg'foC) > (1 - £). 

Let z = 5 u (g s ,C). Let y = f. We have that y < 1. Since IljtiU ~ jl) > and 
y < 1, there is a jo > such that rijlj (l — ^j) > V- Now it is easy to see that the word 
a = uuj Uj 0+ i ... is accepted by B with probability > zy. But zy is x and the result follows. 

(=>) Assume that C >X (B) ^ 0. Fix an infinite input string 7 G £ >:c (i3). Recall that the 
probability measure generated by 7 and B is denoted by fiB,-y- For the rest of this proof we 
will just write fx for /ie i7 . 

We will call a non-empty set of states C good if there is an e > 0, a measurable set 
Paths C Q w of runs, and an infinite sequence of natural numbers i\ < %i < 23 < . . . such 
that following conditions hold. 

• ^i(Paths) > x + e; 

• For each j > and each run p in Paths, we have that 

(a) p[0] = q s , p[ij] G C and 

(b) at least one state in the finite sequence p[ij,ij + \] is a final state. 

We say that a good set C is minimal if C is good but for each q G C, the set C \ {q} is not 
good. Clearly if there is a good set of states then there is also a minimal good set of states. 
Claim: 

• There is a good set of states C. 
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• Let C be a minimal good set of states. Fix e, Paths and the sequence i\ < %2 < ■ ■ ■ which 
witness the fact that C is a good set of states. For each q G C and each j > 0, let Paths^g 
be the subset of Paths such that each run in Pathsj j(? passes through q at point ij, i.e., 
Pathsj i<? = {p G Paths | p[ij] = q}. Then there exists a p > such that /x(Paths Jig ) > p for 
each q G C and each j > 0. 

We first show how to obtain the Lemma using the above claim. Fix a minimal set of good 
states C. Fix e, Paths and the sequence i\ < 22 < • • • which witness the fact that C is a 
good set of states. We claim that C is the required set of states. As ^(Paths) > x + e and 
for each p G Paths, p[i\] G C, it follows immediately that Reach{B , C , x) . Assume now, by- 
way of contradiction, that there exists a jo > such that for each finite word u, there exists 
a q G C such that Su f (q, C) < 1 — ~. Fix jo. Also fix p > such that ^(Pathsj^) > p for 
each j and q £ C, where Paths^q is the subset of Paths such that each run in Paths-, )(? passes 
through q at point i,y, the existence of p is guaranteed by the above claim. 

We first construct a sequence of sets Lj C Q + as follows. Let L\ C Q + be the set of 
finite words on states of Q of length i\ + 1 such that each word in L\ starts with the state 
q s and ends in a state in C. Formally L\ = {r] G Q + \ \rj\ = i\ + 1 , 77 [0] = q s and r][ii] G C}. 
Assume that L r has been constructed. Let L r+ \ C Q + be the set of finite words on states 
of Q of length i r+ \ + 1 such that each word in L r+ \ has a prefix in L r , passes through a final 
state in between i r and v+i, and ends in a state in C. Formally, L r+ \ = {r/ G Q + | |ry| = 
i r+ i + l,r/[0 : i r ] G L r ,3i.(i r < i < i r+ \ A r/[i] G Qf)}- 

Note that (L r S w ) r >i is a decreasing sequence of measurable subsets and Paths C 
n r >i-L r X w . Now, it is easy to see from the choice of jo andp that /^(L r +iS w ) < ^(L r S w ) — 
This, however, implies that there is a ro such that /i(L ro S w ) < 0. A contradiction. Thus, it 
suffices to show that the claim is correct. 
Proof of the claim: 

(1) For each k > 0, let = post(g s ,7[0 : k]). Since the set of states Q is finite, there must 
be some C such that Ct = C for infinitely many fc's. Fix one such C. We claim that 
C is a good set of states. We need to show that C satisfies the definition of good set 
of states. So we need to construct e, Paths and the infinite sequence i\ < 12 < ■ ■ ■ as in 
the definition of good set of states. We will pick e > such that p'g^ = x + 2e. We 
construct Paths and the sequence i\ < 12 < ■ ■ ■ as follows. 

First let Pathso be the set of all runs starting in q s and visiting the final states infinitely 
often. Pathso is measurable and /[/(Pathso) = x + 2e. Take i\ > to be the smallest 
integer such that post(g s ,7[0 : i±]) = C. Inductively, assume that we have constructed 
a sequence of integers i\ < 12 < ■ ■ ■ < ij+i, and a measurable set Pathsj C Pathso such 
that 

(a) ^(Pathsj) > x + e + jj, 

(b) for each p G Paths^ and k < j + 1, p[ik] G C, and 

(c) for each p G Pathsj and k < j + 1, there is some i between and ik+i such that 
p\i] G Q f . 

Observe that Pathso and i\ satisfy that above conditions as condition (c) holds vaccu- 
ously. Now for each £ > 0, Paths^ C Pathsj be the set of runs that visit a final state at 
least one time between and Formally, Pathsj = {p G Paths^ | 3i.(ij + \ < i < 

ij+i + £ A p[i] G Qf)}- Clearly Pathsj is an increasing sequence of measurable sets and 
U^Pathsj = Pathsj (each run in Pathsj visits the set of final states infinitely often). 



POWER OF RANDOMIZATION IN AUTOMATA ON INFINITE STRINGS * 



15 



Since //(Paths.,-) > x + e + ^j, there must exist a 4> such that //(Paths^ ) > x + e + §(■§■)• 
Fix £q and let > ij+i + £o be the smallest integer such that post(g s , 7(0 : ij+2]) = C. 
Let Pathsj+i = Paths^ . It is easy to see that Pathsj+i is measurable and that it satisfies 
the conditions (a), (b) and (c), assumed inductively about Pathsj. 

Observe that the above inductive construction ensures that Paths J+ i C Paths., . Take 
Paths = (IjgNPathsj. It is easy to see that Paths and the sequence %\ < 12 < ••• 
constructed inductively, satisfy the claim. 
(2) We have that C is minimal good set of states. Note that as C is finite, we only need to 
show that for each q € Q, infj>o /i(PathSj )9 ) > 0. We proceed by contradiction. Assume 
that there is some q such that infj>o /i(Paths 3 - g ) = 0. Fix one such q. We will obtain a 
contradiction to minimality if we can show that C \ {q} is also a good set of states. 

In order to show that C \ {q} is a good set of states, we have to satisfy the definition 
of a good set of states. 

Now, since infj>o //(Pathsj )9 ) = 0, there is some j\ such that /z(Pathsj 1)9 ) < f. Let 
Paths 1 = Paths \ Paths^g. We have that Paths 1 C Paths, ^(Paths 1 ) > x + § + f and for 
each p G Paths 1 , p[ij x ] G C \ {q}. 

Now, again as inf J>0 /i(PathSj j(? ) = 0, there is some j'2 > j% such that /i(PathSj 2i g) < |. 
Let Paths 2 = Paths 1 \ Paths 3 - 2) g. We have that Paths 2 C Paths 1 , /i(Paths 2 ) > x + | + | 
and for each p <E Paths 2 , p[ij 2 ] E C \ {q}. Note also that as j'2 > j\ and Paths 1 C Paths, 
we have that for each each p € Paths 2 there is some i such that ij 1 < i < ij 2 and 
p[i] e Q f . 

We can continue and obtain a sequence Paths J1 D Paths- 72 3 ... of measurable sets, 
and sequence ij 1 < ij 2 < . . . such that for each I > 0, //(Paths- 7 *) > x + | + ^ and for 
each p € Paths- 7 ', p[ij L ] 6C \ {q}. Furthermore for each I > 1 and each p € Paths' there 
is some i such that < i < ij l and p[i] € Qf. 

Let Paths' = n^ >0 Paths'. We have that /x(Paths') > x + |. Clearly |, Paths' and the 
sequence ^ a < ij 2 < . . . witness the fact that C \ {g} is a good set of states. □ 

We get immediately that the probable semantics of a PBA, if non-empty, must contain a 
strongly asymptotic word. 

Corollary 3.6. Given a PBA B, £>o(£>) 7^ iff C>o(B) contains a strongly asymptotic 
word. 

Lemma 13.51 also implies that emptiness-checking of C>q(B) for a given a RatPBA B is 
in £°. 

Corollary 3.7. Given a RatPBA, B, the problem of deciding whether £>o(£>) = is in 

Proof. Let us fix a PBA 23 = (Q,q s ,Qf,5), and x G [0, 1). Now Lemma [331 says that the 
non-emptiness of C >X (B) is equivalent to the following property 

ip = 3C C Q. 3u G S*. C) > x)A 

(Vj. 3u, G E*. (Vg G C. 6$f(q,C) > (1 - i))) 

which can be rewritten as (by moving quantifiers out) 

if = 3C C Q. Vj. 3u G E*. 3itj- G E*. ({5 u (q s , C) > x)A 

(VqeC.6$f(q,C)>(l-X))) 
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Now consider the property ip given as 

ip = Vj. 3Cj CQ.3ue £*. 3 Uj G £*. ((<5 u (g s , C,) > a?) A 

(VgGC,-. ^/(g,C,)>(l-i))) 

Clearly, ^ logically follows from ip. However, in our specific case, it turns out that in fact, 
ip is equivalent to ip due to the following observations. First note, that since there are only 
finitely many subsets of Q, there must be a C C Q such that C = Cj for infinitely many 

j (if ip holds). Further observe that if 3 Uj .(Vq G C. 5u/(q,C) > (1 - i)) for some j then 
Q f 1 

3iii.(Vq G C. 5 U< (g, C) > (1 — ^r)) holds for all i < j. From these it follows that ip logically 
follows from ip. 

Of i 

Observe that (5 u (q s ,C) > x) and (Vg G C. 5 U] (q, C) > (1 — ^7)) are recursive predicates. 
Thus, ip demonstrates that the non-emptiness problem is in II!], which means that emptiness 
is in £§. " □ 

We will now show that the emptiness problem is also Sg-hard. 

Lemma 3.8. Given a RatPBA, B, the problem of deciding whether C > q(B) = is S^-hard. 

Proof. The hardness result will be obtained by significantly modifying the proof in [BBG08 , 
where the emptiness problem was shown to be R.E.-hard. 

Consider a deterministic two counter machine M with two counters and a one way, read 
only input tape. We can capture the computation of M, as a sequence of configurations 
where each configuration is a 4-tuple {q,x,a l ,m) where q is the state of the finite state 
control that M changed to, x is the input symbol that is read and i,j are the new counter 
values and m indicates whether the input head stayed in the same place, or moved right 
and read a new input symbol. Here m G {same, right}. Note that the two counter values 
are represented in unary having a string of as and 6s, respectively. Thus, a computation of 
M can be described by a string over alphabet £' that includes the states of M, the input 
symbols of M, the symbols a, b, same, right, (, ), and in this proof we will restrict our 
attention to machines M that read all the input symbols; thus, the number of steps in a 
computation is at least the length of the input. A halting computation is a sequence of 
configurations ending in a halting state. Define L(M) to be the set of input strings on 
which M halts. Let (M) be a binary encoding of M. Consider U = {(M) | L(M) / 0} and 
V = {(M) I L(M) is finite}. Recall that % is R.E.-complete and V is S^-complete. 

Our proof of hardness will be as follows: Given a deterministic two counter machine M, 
we will construct three RatPBAs Vi,V 2 , and V 3 such that (M) G V iff £> (Pi) nC >0 (V 2 ) n 
£>o(^ > 3) = 0- Since L(PBA >0 ) is closed under intersection and the intersection of automata 
can be effectively constructed [BBG08, Gr608], this will demonstrate a reduction from T> 
to the emptiness problem and therefore prove the hardness result. Our construction of the 
RatPBAs Vi,V 2 , and V 3 relies on ideas in |CL89j and [BBG081 [Gro08] . Hence, we begin 
by recalling the key ideas from these papers that we will exploit. 

For the rest of this proof, let us fix a deterministic two-counter machine M whose 
computations can be encoded as strings over Consider any rational e such that < 
e < ^. [CL89] give the construction of a PFA 7Z (that depends on M and e) over alphabet 
Eft = £' U {@}, where @ G" £'. We can show that this PFA K satisfies the following 
properties. 

(1) There exists an (computable) integer constant d > 2 such that if w is a valid and 
halting computation of M of length n, then the input string (w@) is accepted by 1Z 
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with probability > (1 — e); that is, the string obtained by concatenating w, d n number of 
times, where successive concatenations are separated by @, is accepted with probability 
at least 1 — e. 

(2) Consider any input u = wi@W2@ ■ ■ ■ @w m @, where no itfj is a valid halting computation 
of M. TZ accepts u with probability at most e. 

The proof that TZ satisfies these properties is defered to the Appendix. Let 
denote the probability with which the input w is accepted by TZ. Observe that the above 
construction has the following property: if L(M) = then any input string is accepted by 
TZ with probability at most e; on the other hand, if L(M) ^ then there is some input that 
is accepted with probability at least 1 — e. 

Using the above construction of TZ, |BBG081IGro08] reduce Ti to the emptiness problem, 
thus demonstrating its R.E. -hardness. The main ideas behind this are as follows. Let 
S-p = U {ft, $}, where (j and $ are symbols not in E^. [BBG081 IGro08j construct two 
RatPBAs V\ and V2 over alphabet S-p such that £>oCPi) is 

KKft • • • <$$™f>i • • • <$$ • • • K G Sfc and - ( II C 1 - Z^))) > 

j>i i=i 

and C > q(V2) is 

{vi$$u 2 $$ • • • : Wi E (E U {«})* and J](l - (1 - e) 9 ^) = 0} 

i>l 

where g(vi) is the number of ft symbols in Wj. Let L\ = £ > o('Pi) and L2 = C->o{T > 2)- The 
following two observations are shown in [BBG08, Gr608 . 

(1) Consider any input w = wlftw^'ft ■ ■ ■ w\ %%w\ ■ ■ ■ w^ 2 $$ • • • , where w\ E and 

acc < e . if ^ £ then w &L\. 

(2) Suppose u>i , , . . . are (not necessarily distinct) words over £7^ such that ^q^ w . > 1 — e. 
For any e < ^ , there are ki , k% , &3, . . . such that 

(witt) fcl - 1 u;i$$(u; 2 tt) fe2 " 1 w;2$$--- 

belongs to ^! n L 2 . 

Observe that the above two observations allow [BBG08 , Gr608 to conclude that L1P1L2 7^ 
iff there is some w such that fJ>^° w > 1 — e. Thus, using properties of TZ, one can see that 
Ti can be reduced to the emptiness problem, therefore demonstrating its undecidability. 

In order to prove the tighter lower bound of we would like to extend the above 
ideas to obtain a reduction from T>, instead of Ti. We first outline the intuitions behind 
the extension. Suppose u\,U2, ■ ■ ■ are (not necessarily distinct) halting computations of M. 
Consider the input word 

w(h,k2, ...) = (( U i@/ 1 tt) fcl - 1 (n 1 @/ 1 $$(( U2 @/ 2 tD fc2 ~ 1 (n 2 @/ 2 $$ • • • 

where = d) Ui \. From the preceding paragraphs it can be seen that there is a choice of 
ki, k2, ■ ■ ■ such that w{k\, k2, • • •) £ L± f] L 2 . To obtain a reduction from T>, we need to 
"check" that infinitely many among the computations m,U2, ■ ■ ■ correspond to distinct in- 
puts. To do this we will construct a third RatPBA V3 that will check that the computations 
Ui grow unboundedly. Since M is deterministic, passing the test imposed by V3 ensures 
that L(M) is infinite, and conversely, if L(M) is infinite then our assumption that M reads 
all the input symbols ensures that there will be some string that passes the V3 test. 
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a; A a; 1 




Figure 4: Automata V3. Here * indicates any input symbol, b any input symbol that is not 
$, and a any input symbol that is not @, ft or $. 

We now outline the formal details. The RatPBA V3 has S-p as input alphabet and 
is shown in Figure |U It has four states so, si, S2, s r where so is the initial state and the 
only final state. s r is an absorbing state. The transition probabilities depend on a pa- 
rameter A that we will fix later. In state sq, on inputs other than the machine 
transitions to so with probability A and s± with probability 1 — A; on @, ft, $ it goes to 
s r with probability 1. In state s% it behaves as follows. On input @, # it goes to state 
So with probability 1; on input $ it goes to S2; on all other inputs, it remains in si with 
probability 1. In state S2 it behaves as follows. On input $ it goes to state so with 
probability 1; on all other inputs, it goes to s r with probability 1. We will pick A to be 
such that A • d <C ^; recall that d is the constant associated with 1Z. Let SeqComp = 
{uqXqU\Xi ■ ■ ■ I Xi G {@, #, $$} and @, #, $ do not appear in the strings Uj}. It can be eas- 
ily shown that £>o{Ps) Q SeqComp. In addition, consider a = uoXqu^xi ••• G SeqComp 
with Xi G {@, $$} and @, $ not in Uj. If there is an t such that for infinitely many i, 
\ui\ = t then a £ >C>o('p3)- We conclude the proof by showing the following claim. 
Claim: L(M) is a finite set iff £ >0 (Vi) n £ >0 (V 2 ) n C^Vz) = 0. 

Proof of the claim: Let C(M) be the set of valid halting computations of M. Since we 
assume that M is deterministic and reads all the input symbols, we can conclude that L(M) 
is finite iff C(M) is finite. Suppose L(M) is finite and a G £>o{V2) H >C>o('P3). Since a is 
accepted by P3, we know that the computations in a grown unboundedly. However, since 
C(M) is finite, we can conclude that there is a suffix j3 of a such that none of computations 
of M in /3 are valid and halting. Thus, j3 = u^tl^tt ' ' ' Wki§§ w i$ w 2 ' ' ' w k 2 $$ ' ' ' suc ^ * na * 
j < e. Coupled with the fact that a G £>o("P2)i we can argue that a ^ £>q("Pi) using 

a similar reasoning as in [BBG08J. 

Suppose L(M) is an infinite set. Hence C(M) is also an infinite set. Let Ui,Uz, ... be 
some distinct computations in C(M); we will describe how to choose Ui later. As before, 
consider 

W(h,k 2 , ...) = ((U 1 @)^tj) fcl - 1 («l@) 4 $$((«2@) & ») fa - 1 («2@) £2 $$ • • • 

where £i = d^. As mentioned before, there are fei, k 2 , ■ ■ ■ such that w(ki, k 2 , ■ ■ •) is accepted 
by both V\ and V 2 . In addition, the probability that V-$ accepts w(ki, k 2 , . . .) is ni>oP* 
where pi = (1 — A' Ui ')^' fci . We will choose u\ (or rather its length) to be a computation so 
that pi is > (1 — i); this will ensure that Ili>o^ ^ s non-zero. Assuming A to be very small 
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and substituting for li = w Ui \ it is easily seen that pi > (1 — {d ■ X)\ Ui \) ki . Here d,ki are 
fixed and A is a small constant such that d ■ A *C i. Now, it should be easy to see that we 
can chose a sufficiently long halting computation Uj so that (l-(d-A)l^l)^ > □ 

Since the class L(PBA >0 ) is closed under complementation and the complementation 
procedure is recursive [BBG08J for RatPBAs, we can conclude that checking universality of 
C > q{B) is also X^-complete. The same bounds also apply to checking language containment 
under probable semantics. Note that these problems were already shown to undecidable in 
[BBG08J, but the exact complexity was not computed therein. 

Theorem 3.9. Given a RatPBA, B, the problems 1) deciding whether £>o(i3) = and 2) 
deciding whether £>o(£>) = E^, are S^-complete. Given another RatPBA, B', the problem 
of deciding whether £>o(£>) C £ >0 (£>') is also Incomplete. 

Proof. Since L(PBA >0 ) is closed under complementation and the complementation is re- 
cursive [BBG08, Gr608l, Lemma 13.81 immediately implies that the problems of universality, 
emptiness and set containment are SS^hard. Also observe that given B\ and B 2 , we have 
that £ >0 (£i) C £ >0 (B 2 ) iff £> (Bi) n (S w \£ >Q (H 2 )) = 0. Now, results of [BBG081 IGto08] 
show that there is a constructible B3 such that £>o(£>3) = £>o(£>i) H (T,^ \ £>o(£>2)). Now, 
thanks to Corollary 13 .7\ the problems of universality, emptiness and set containment are in 

Eg. □ 

Remarks 3.10. Lemma 13.51 can be used to show that emptiness-checking of C^i(B) for 

a given RatPBA B is in £!]. I n contrast, we had shown in [CSV09aJ that the problem of 
deciding whether C > 1 (A4) = S w for a given FPM A4 lies beyond the arithmetical hierarchy. 



4. Almost-sure semantics 

The class L(PBA =1 ) was first studied in [BBG08], although it was not characterized topo- 
logically. In this section, we study the expressiveness and complexity of the class L(PBA =1 ). 
We will also demonstrate that the class L(PBA =1 ) is closed under finite unions and inter- 
sections. As in the case of probable semantics, we assume that the alphabet S is fixed and 
contains at least two letters. 

4.1. Expressiveness. In this section, we shall establish new expressiveness results for the 
class L(PBA =1 )- 

• L(PBA =1 ) C Q 5 . This is an immediate consequence of Theorem 13.11 Lemma 13.21 and 
Lemma 13.31 

• Regular n L(PBA =1 ) = Regular n Deterministic (see Proposition I4.1j) . 

• The class L(PBA =1 ) is closed under union and intersection, (see Corollary 14.31) . 

We start by characterizing the intersection RegularnL(PBA =1 ). Note that the fact every lan- 
guage L(PBA =1 ) is contained in Q$ implies immediately that there are w-regular languages 
not in L(PBA =1 ). That there are w-regular languages not in L(PBA =1 ) was also proved 
in [BBG08] , although the proof therein is by explicit construction of an u;-regular language 
which is then shown to be not in L(PBA =1 ). Our topological characterization of the class 
L(PBA =1 ) has the advantage that we can characterize the intersection Regular nL(PBA =1 ) 
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exactly: Regular n L(PBA ) is the class of w-regular languages that can be recognized by 
a finite-state deterministic Biichi automaton. 

Proposition 4.1. For any PBA B, C=i(B) is a Qg set. Furthermore, Regular(~iL(PBA =1 ) = 
Regular n Deterministic and Regular n Deterministic C L(PBA =1 ) C Qg = Deterministic. 

Proof. Lemma 13.21 Theorem 13.11 and Lemma 13.31 already imply that L(PBA =1 ) C Qg = 
Deterministic. We only need to show that Regular n L(PBA =1 ) = Regular n Deterministic. 
Since every language in L(PBA =1 ) is deterministic, we get immediately that Regular n 
L(PBA =1 ) C Regular n Deterministic. For the reverse inclusion, note that every w-regular, 
deterministic language is recognizable by a finite-state deterministic Biichi automaton. It 
is easy to see that any language recognized by a deterministic finite-state Biichi automaton 
is in L(PBA =1 ). The result follows. □ 

A direct consequence of the characterization of the intersection Regular n Deterministic 
is that the class L(PBA =1 ) is not closed under complementation as the class of w-regular 
languages recognized by deterministic Biichi automata is not closed under complementa- 
tion. That the class L(PBA =1 ) is not closed under complementation is also observed in 
[BBG08], and is proved by constructing an explicit example. However, even though the 
class L(PBA =1 ) is not closed under complementation, we have a "partial" complementa- 
tion operation — for any PBA B there is another PBA B' such that £>o(£>') is the comple- 
ment of C=i(B). This also follows from the following results of [BBG08] as they showed 
that L(PBA =1 ) C L(PBA >0 ) and L(PBA >0 ) is closed under complementation. However 
our construction has two advantages: 1) it is much simpler than the one obtained by the 
constructions in [BBG08J, and 2) the PBA B' belongs to the restricted class of finite prob- 
abilistic monitors FPMs (see Section [2] for definition of FPMs). This construction plays a 
critical role in our complexity analysis of decision problems. 

Lemma 4.2. For any PBA B, there is an FPM M such that C=i{B) = S w \ £ >0 (M). 

Proof. Let B = (Q,q s ,Qf,6). We construct M. as follows. First we pick a new state q r , 
which will be the reject state of the FPM A4. The set of states of A4 would be Q U {q r }- 
The initial state of Ad will be q s , the initial state of B. The set of final states of KA will be 
Q, the set of states of B. The transition relation of A4 would be defined as follows. If q is 
not a final state of B then the transition function would be the same as for B. If q is an final 
state of B then A4 will transit to the reject state with probability \ and with probability | 
continue as in B. Formally, M. = (Q U {q r }, q s , Q, 6m) where 6m is defined as follows. For 
each a G S, q,q' G Q, 

• 6 M {q,a,q r ) = \ and 6 M {q,a,q') = \6{q,a,q') if q E Qf, 

• 6M(q,a,q r ) = and 6M(q,a,q') = 6(q,a,q') if q G Q \Qf, 

• 6 M (q r ,a,q r ) = 1. 

It is easy to see that a word a E is rejected with probability 1 by M. iff it is accepted 
with probability 1 by B. The result now follows. □ 

The "partial" complementation operation has many consequences. One consequence is 
that the class L(PBA _1 ) is closed under union. The class L(PBA _1 ) is easily shown to be 
closed under intersection. Hence for closure properties, L(PBA =1 ) behave like deterministic 
Biichi automata. Please note that closure properties were not studied in [BBG08J. 

Corollary 4.3. The class L(PBA =1 ) is closed under finite union and finite intersection. 
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Proof. Let B\ = (Q 1 , q\, Qj, 5 1 ) and B 2 = (Q 2 , q 2 s , Q 2 , 5 2 ) be two PBAs, and we assume 
without loss of generality that Q 1 (1 Q 2 = 0. We will present construction of PBAs that 
recognize the union and intersection of these languages under the almost sure semantics. 

We begin by first considering the construction for union. Now by Lemma 14.21 we know 
that there are FPMs Mi and M 2 such that C =1 (Bi) = S w \ C >0 (Mi). Now, we had 
shown in |CSV09a] that there is a FPM A4 = (Q,q s ,Qf,S) such that for any word a, 
Mm% = VMua x ^M2,a- lt is eas y to see that C >0 (M) = C^Mx) n C >0 (M 2 ). 

Now, the FPM Ai can be easily "complemented". If q r is the reject state of A4, 
then consider the PBAjW = (Q,q s ,{q r },5); clearly C =1 (M) = S w \ £ >0 (M). Thus, by 
DeMorgan Laws, C =1 (M) = £=i(Bi) U £=i(B 2 )- 

The PBA recognizing the intersection of the languages recognized by B\ and B 2 with 
respect to almost-sure semantics does the following: on an input a, with probability \ it 
runs B\ on a, and with probability ^ it runs B 2 . Clearly, such a machine will accept (with 
respect to almost-sure semantics) iff both B\ and B 2 accept. Formally, B = (Q,q s ,Qf,6) is 
given by 

• Q = Q 1 UQ 2 U {q s } where q s # Q 1 U Q 2 

• Q f = Q}uQ 2 

• The transition relation 5 is defined as follows 

— For q € Q l , S(q s ,a,q) = ^(q^^q), and for q G Q 2 , S(q s ,a,q) = ±5 2 (q 2 ,a,q) 

- For q,q' € Q l , 6(q,a,q') = 5 1 (q,a,q') and for q, q' E Q 2 , 5(q,a,q') = 5 2 (q,a,q'). □ 

4.2. Decision problems. For the rest of this section, we shall focus our attention on 
decision problems for almost sure semantics for RatPBAs. Results of this section are sum- 
marized in the second row of Figure [3] on page [12] and proved in Theorem 14.41 and Theorem 
14.51 The problem of checking whether C=±(B) = for a given RatPBA B was shown to 
be decidable in EXPTIME in [BBG08] . where it was also conjectured to be EXPTIME- 
complete. The decidability of the universality problem was left open in [BBG08]. We can 
leverage our "partial" complementation operation to show that a) the emptiness problem is 
in fact PSPACE-complete, thus tightening the bound in T3BG08J and b) the universality 
problem is also PSPACE-complete. 

Theorem 4.4. Given a RatPBA B, the problem of deciding whether C=i{B) = is 
PSPACE-complete. The problem of deciding whether C =1 {B) = is also PSPACE- 
complete. 

Proof. (Upper bounds.) We first show the upper bounds. The proof of Lemma [4.21 shows 
that for any RatPBA B, there is a RatFPM A4 constructed in polynomial time such that 
C=i(B) = £ w \£>o(A4). C=\(B) is empty (universal) iff £>o(A-l) is universal (empty respec- 
tively). Now, we had shown in |CSV08[ IC~SV09a| that given a RatFPM M, the problems of 
checking emptiness and universality of jC > q(A4) are in PSPACE, thus giving us the desired 
upper bounds. 

(Lower bounds.) We had shown in |CSV08[ ICSV09aj that given a RatFPM M, the 
problems of deciding the emptiness and universality of £>o(A4) are PSPACE-hard respec- 
tively. Given a RatFPM M. = (Q,q s ,Qo,o~) with q r as the absorbing reject state, consider 
the PBA M. = (Q, q s , {q r }, 5) obtained by considering the unique reject state of M. as the 
only final state of M. Clearly we have that £ >0 (M) = S w \ C =1 (M). Thus C >0 (M) is 
empty (universal) iff C = i(A4) is universal (empty respectively). The result now follows. □ 
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Figure 5: Complexity of decision problems for RatHPBAs. 

Even though the problems of checking emptiness and universality of almost-sure se- 
mantics of a RatPBA are decidable, the problem of deciding language containment under 
almost-sure semantics turns out to be undecidable, and is indeed as hard as the prob- 
lem of deciding language containment under probable semantics (or, equivalently, checking 
emptiness under probable semantics). 

Theorem 4.5. Given RatPBAs, B\ and B 2 , the problem of deciding whether £=i(Bi) C 
C=i(B 2 ) is Xl^-complete. 

Proof. Observe first that given RatPBAs B\ and B2, there are (constructible) RatFPMs 
Mi and M 2 such that £=i(Bj) = S w \ £ >0 (Mi) for i = 1,2 (see Lemma S2D- Thus, 
£=i(Bi) C £=i(B 2 ) iff £>o(M 2 ) C C =1 (Mi). The upper bound then follows from the 
upper bound of the containment of PBAs under probable semantics. 

The lower bound is shown by a reduction from emptiness-checking of probable seman- 
tics. Recall from the proof of the fact that L(PBA >0 ) = BCI(L(PBA =1 )) (Theorem 
that given a RatPBA B, there are RatPBAs B±, B^ , ■ ■ ■ B^ and £>f, B^ . . . £>~ such that 

£ >0 (B) = |J £ =1 (,B+) n(£"\£ =1 (0r)). 

l<i<m 

Furthermore, the construction in the proof of Theorem 13. II and results of [Gr608] (Proposi- 
tion 12.51 and the fact that complementation of probable semantics is a recursive operation 
for RatPBAs) implies that Bf and B^~ are constructible. Now, £>o(£>) = iff for each i, 
C = \{Bf) n (S w \ C=i(B^)) = 0. The lower bound now follows from the observation that 
C =1 (B+) n (E« \ C =1 {Br)) = iff <C =1 (B+) C C =1 (B7). □ 

5. Hierarchical PBAs 

We now identify a simple syntactic restriction on PBAs which- 

• under probable semantics coincide exactly with w-regular languages, and 

• under almost-sure semantics coincide exactly with w-regular deterministic languages. 

We will also establish complexity of decision problems of emptiness and universality for the 
case when transition probabilities are given as rational numbers. The complexity results 
are summarized in Figure [5) 

Intuitively, a hierarchical PBA is a PBA such that the set of its states can be stratified 
into (totally) ordered levels. From a state q, for each letter a, the machine can transition with 
non-zero probability to at most one state in the same level as q, and all other probabilistic 
transitions go to states that belong to a higher level. Formally, 

Definition 5.1. Given a natural number k, a PBA B = (Q, q s , Q, 5) over an alphabet S is 
said to be a k-level hierarchical PBA (7c-HPBA) if there is a function rk : Q — > {0, 1, . . . , k} 
such that the following holds. 
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Given j e {0,1,..., k}, let Qj = {q G Q | rk(Q) = j}. For every q G Q and 
a G E, if jo = r k(<?) then post(g, a) C Uj <£<kQe and |post(g, a) D Q J0 | < 1. 

The function rk is said to be a compatible ranking function of i3 and for q G Q the natural 
number rk(g) is said to be the rank or level of q. B is said to be a hierarchical PBA (HPBAJ if 
i3 is /c-hierarchical for some k. If is also a RatPBA, we say that B is a rational hierarchical 
PBA (RatHPBA). 

We can define classes analogous to L(PBA >0 ) and L(PBA =1 ); and we shall call them 
L(HPBA >0 ) and L(HPBA =1 ) respectively. Before we proceed to discuss the probable and 
almost-sure semantics for HPBAs, we point out two interesting facts about hierarchical 
HPBAs. First is that for the class of w-regular deterministic languages, HPBAs like non- 
deterministic Biichi automata can be exponentially more succinct. 

Proposition 5.2. Let E = {a,b,c}. For each n G N, there is a w-regular deterministic 
language L n C E w such that i) any deterministic Biichi automata for L n has at least 0(2") 
number of states, and ii) there are HPBAs B n s.t. B n has 0(n) number of states and 
L n = C =1 (B n ). 

Proof. Given n G N, let L n be the safety language in which for every a there is a c after 
exactly n-steps. In other words, L n = E w \ (E*aE ri {a, 6}E W ). This could model, for instance, 
the property "every request a is answered after exactly n-steps" . We can build a determin- 
istic Biichi automaton for L n and the number of states of such a automaton is 0(2"). We 
could build a HPBA B n with O(n) state such that L n = £=i(B n ). The HPBA B n will be an 
FPM also. The construction of B n is as follows — B n scans the input and upon encountering 
a, B n decides with probability \ to check if there is a c after n steps and with probability 
^, B n decides to continue scanning the rest of the input. In the former case, if the check 
B n reveals an error then B n rejects the input; otherwise B n accepts the input. □ 

The second thing is that even though HPBAs yield only w-regular languages under both 
almost-sure semantics and probable semantics, we can recognize non-w-regular languages 
with cutpoints. 

Proposition 5.3. There is a HPBA B such that both C > i{B) and jC^i(B) are not u- 

— 2 > 2 

regular. 

Proof. The HPBA we will construct will actually an FPM. The following construction is 
given in |GSV08j . Let E = {0,1}. Let Q = {<?o, <7i, <7r} and 5 : Q x E x Q [0, 1] be 
defined as follows. The states q r and q± are absorbing, i.e., 5(q r ,0,q r ) = 5(q r ,l,q r ) = 
5(qi,0,qi) = 8(qi,l,q%) = 1. Transitions out of qo satisfy 5(qo,0,qo) = 5(qo,0,q r ) = 
5(qo, 1, go) = 5(qo, 1> <Zl) = \- Consider the FPM M\d = (Q, qo, {qo, qi}, 5). M\d can be seen 
to be 2-hierarchical with rk(go) = 0, rk(gi) = 1 and rk(q r ) = 2. Given a = aoa\ . . . , it can 
be shown that a = bin (a) where bin (a) is the real number: ^ n^mja,) w j iere num (o) 
is the integer and num(l) is the integer 1. 

Now, consider the FPM A4\d o A4|d constructed as follows. The states of this FPM are 
{?o ; Qi} x {qo, Qi} U <Zr ne „- The initial state is (go, Qo) and the reject state is q rnew - The transi- 
tion probabilities, from the state (q^, q^) on input a G {0, 1} is defined as follows — to state 
{Q121Q02) the transition probability is {(^,0,^) x 5(qj 1 , a, qj 2 ) and to state gv new the tran- 
sition probability is 1 - J2i 2 ,j 2 e{o,i} %ii, a ,% 2 ) x ^iq^, a, q j2 ). The state gv new is absorbing. 



21 



R. CHADHA, A. P. SISTLA, AND M. VISWANATHAN 



The FPM .Mid o A4|d can be seen to be hierarchical with rk(gj 1 ,g , j 2 ) = i\ + %i- Further- 
more, it can be shown that on word a, ^"m^oMu a = (bin(a)) 2 . Thus, C > i(M\d o .Mid) = 

{a | bin(a) > \J\} and £>i(Mid °Mid) = {a | bin(a) > y|} ; both of which are not 
w-regular. □ 

Remarks 5.4. We will see shortly that the problems of deciding emptiness and universality 
for a HPBA turn out to be decidable under both probable and almost-sure semantics. 
However, with cutpoints, they turn out to be undecidable. The latter observation is out of 
scope of the paper. 



5.1. Probable semantics. We shall now show that the class L(HPBA ) coincides with 
the class of w-regular languages. In [BG05j , a restricted class of PBAs called uniform PBAs 
was identified that also accept exactly the class of w-regular languages. We make a couple of 
observations, contrasting our results here with theirs. First the definition of uniform PBA 
was semantic (i.e., the condition depends on the acceptance probability of infinitely many 
strings from different states of the automaton), whereas HPBA are a syntactic restriction 
on PBA. Second, we note that the definitions themselves are incomparable in some sense; 
in other words, there are HPBAs which are not uniform, and vice versa. Finally, HPBAs 
appear to be more tractable than uniform PBAs. We show that the emptiness problem for 
L(HPBA >0 ) is NL-complete. In contrast, the same problem was demonstrated to be in 
EXPTIME and co-NP-hard |BG05j for uniform PBAs. 

We first establish that every w-regular language can be recognized by a hierarchical 
PBA; this is the content of the next Lemma. 

Lemma 5.5. For every w-regular language L, there is a hierarchical PBA B such that 
l = C >0 (B). 

Proof. Let 1Z = (Q,q s ,F,A) be a deterministic Rabin automaton recognizing L, where 
F = {(-Bi, Gi), . . . (Bk,Gk)}- The hierarchical PBA will, intuitively, in the first step choose 
the pair (Bi, Gi) that will be satisfied in the run, and then ensure that the measure of paths 
that visit Bi infinitely often is 0. Formally, B = (Q 1 , q' s , Q'j, 5') is given as follows. 

• Q' = {q's, Qr} U ({1, ...k}xQ), where q' s ,q' r & Q 

• Q' f = UL } mxGi) 

• The transition relation 5' is given by 

- ^ (q's, a, (hi)) = I iff (q s ,a,q) G A 

- For q £ B h 5 f ((i, q),a, (i, q')) = 1 iff (q, a, q') £ A 

— For q G Bi, S'((i, q), a, q' r ) = \ for all a G E, and 5'((i, q),a, (i, q')) = i iff (g, o, q') G A 

— Finally, 5'(q' r ,a,q' r ) = 1 for all a G S. 

It is easy to see that C > q(B) = L. Finally, we point out that B is a k + 1-level hierar- 
chical PBA. This is witnessed by the ranking function rk defined as follows — r k(q' s ) = 0, 
rk((i, q)) = i, and rk(^) = k + 1. □ 

Theorem 5.6. L(HPBA >0 ) = Regular. 

Proof. Thanks to Lemma 15.51 we need to show that every language in L(HPBA >0 ) IS UJ- 
regular. The other inclusion follows from the following Claim. 
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Claim: For any hierarchical PBA B = (Q,q s ,Qf,S) and any word a G a G £>o(£>) iff 
there is an infinite sequence of states q s = qo,q\, ■ ■ ■ such that qi G Qf for infinitely many 
ieN, 5(qi, a[i], qi+i) > for alH € N and 3j > such that 5(qi, a[i], qi+\) = 1 for all i > j. 

Proof of the claim: Let 8bea A;-level hierarchical PBA with compatible ranking function 

rk. Let Qj = {q G Q \ rk(q) = j}. The proof will proceed by induction on the level k. 

Base Case: Suppose k = 0. Based on the definition of hierarchical PBAs, this means 

that B is a deterministic Biichi automaton, i.e., for all q,q' G Q and a G S, either 5(q, a, q') = 

1 or 5(q, a, q') = 0. Thus, the claim clearly holds in this case. 

Induction Step: Let a G S w be such that a G £>o(£>), with = x > 0. Observe 

that for every i, \post(q s , a[0, i]) n Qo\ < 1- There are two cases to consider. 

Case 1: Suppose |post(g s , a[0, i]) n Qol = 1 f° r an let us denote the unique state in 
post(g s , a[0, i]) n Qo by Qi- Suppose in addition, there is a j such that for all t > j, 
5(qe,a[£],q£ + i) = 1. Then clearly the sequence 9o>9i)-- - satisfies the conditions of the 
lemma. 

Case 2: Suppose Case 1 does not hold. Then there are two possibilities. The first possibility 
is that there is a Iq such that post(g s , a[0, io}) fl Qq = 0. The second possibility is that for 
every j, there is a £ > j such that a[£], qt+i) < 1, where once again we are denoting 
the unique state of Qo i n post(q s ,a[0,£]) by In this second subcase, there must then 
exist an io such that S u (q s ,qi ) < x, where u = a[0,io]. 

Now, based on the definition of io given for the two subcases above, it must be the case 
that for some state q £ post(q s , a[0, io])\Qo, the measure of accepting runs from q on the 
word a[io + l]a[io + 2] • • ■ is non-zero. Consider the hierarchical PBA B' = (Q 1 , q, Q'j, 5'), 
where Q' = Q \ Qo, Q'j = Qf \ Qq and 5' = £|q' X £xQ'- Clearly, B' is a k — 1-level 
hierarchical PBA, and thus by induction hypothesis, the string a[io + l]a[io + 2] • • • has 
a run q = q' q[ . . . satisfying the conditions in the claim. The desired run for a (in PBA 
B) satisfying the conditions in the lemma is obtained by concatenating a run from q s to 
q on a[0, io] with q' q[ (End proof of claim). □ 

We now proceed with the main theorem. Let B = (Q,q s ,Qf,5). We will construct a 
finite-state nondeterministic Biichi automaton A = (Q', q' s , Q'f, A'), such that the language 
recognized by A is exactly £>o(£>). Intuitively, the set of states Q' will consist of two copies 
of Q — Q x {0} and Q x {1}. In the first copy, we will simulate the possible transitions 
between pair of states of B (we ignore the exact transition probabilities of B) . In the second 
copy, we will only simulate deterministic transitions of B, i.e., those transitions between 
pair of states which happen with probability 1. From the first copy, we can transit to the 
second copy if the probability of transiting between the corresponding states in B is non- 
zero. From the second copy, we will never transit to the first state. The set of final states 
of A are those states in second level that correspond to the final states of B. Intuitively, 
the construction ensures that if a G L>o(£>), and the sequence q s = qo,q\, . . . and natural 
number j > are such that 

(1) qg G Qf for infinitely many £, 

(2) < 5(qi,a[i],q i+ i) < 1 for all i < j and 5(qi,a[i],q i+ i) = 1 for all i > j 

then (qo, 0), . . . (qj, 0), (qj+i, 1), (qj+2, 1) • • • is an accepting run of A on input a. 

Formally, Q' is the set Q x {0, 1}, q' s = (q s , 0), Q'j = {(q, 1) | q G Qf}, and A' is defined 
as follows. For each (71,(72 G Q, 
• ((ft, 0), a, (q 2 ,0)) G A' iff 5( qi ,a, q 2 ) > 0. 
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• ((gi.O), a, (92, 1)) G A' iff %i,a, 92) > 0. 

• ((gi, 1), a, (ga, 1)) G A' iff % a , a, 9a ) = 1. 

• (((ft, l),o, (92,0)) 6 A' iff never. 

The claim above immediately implies that £>o(£>) is the language recognized by A and 
hence is w-regular. □ 

We will show that the problem of deciding whether C > q(B) is empty for hierarchical 
RatPBA's is NL-complete while the problem of deciding whether £>o(£>) is universal is 
PSPACE-complete. Thus "algorithmically" , hierarchical PBAs are much "simpler" than 
both PBAs and uniform PBAs. Note that the emptiness and universality problem for finite 
state Biichi-automata are also NL-complete and PSPACE-complete respectively. 

Theorem 5.7. Given a RatHPBA, B, the problem of deciding whether C > q{B) = is 
NL-complete. The problem of deciding whether C > q{B) = S w is PSPACE-complete. 

Proof. (Upper Bounds). First note since B is hierarchical, the language £ > o(B) is oj- 
regular (see Theorem 15. 6p . The proof of Theorem 15.61 also allows us to construct a finite- 
state Biichi automata A such that a) £>o(£?) is the language recognized by A and b) the 
size of the automaton A is at-most twice the size of the automaton B. Furthermore, the 
construction can be carried out in NL. Since the emptiness problem of finite-state Biichi 
automata is in NL and the universality problem is in P SPACE, we immediately get that 
the desired upper bounds. 

(Lower Bounds). Please note that the NL-hardness of the emptiness problem can 
be proved easily from the emptiness problem of deterministic finite state machines. For the 
universality problem, we make the following claim. 

Claim: Given an FPM M such that the A4 is also a hierarchical PBA, the problem of 
deciding whether C=\{M) is empty is PSPACE-hard. 

Before, we proceed to prove the claim, we first show how the lower bound follows from 
the reduction. Given an FPM M. = (Q,q s ,Qf,6) with reject state q r , consider the PBA 
•M = (Q,q s ,{Qr}iO~) obtained by taking the reject state of M. as the unique final state of 
M. Clearly, 

(1) M isJIPBA if M is. 

(2) £>o(A / () is universal iff C=\{M) is empty. 

From these two observations the desired result will follow if we can prove the claim. We 
now prove the claim. 

Proof of the claim. We show that there is a polynomial time bounded reduction from 
every language in P SPACE to the language 

{(M, S) | M is an FPM on S, M is a HPBA and C=x{M) = 0}. 

Consider a language L € PSPACE and T be a single tape deterministic Turing machine 
that accepts L in space p(n) for some polynomial p where n is the length of its input. We 
assume that T accepts an input by halting in a specific final state qj and T rejects an input 
by not halting. Let T be given by the tuple (Q, A, T, A, 90, If)- Here Q is the set of states 
of the finite control of T; A, T are the input and tape alphabets and ACT and the blank 
symbol #isinr\A; A:Qxr^TxQx {Left, Right}; 90 is the initial state and qf is 
the final state. Each tuple A(g, a) = (a', 9', d) indicates that when T is in state 9, scanning 
a cell containing the symbol a, then T writes value a' in the current cell, changes to state 
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q' and moves in the direction d. Without loss of generality, we assume that head position 
of T initially is at cell number 0. 

Let <3?' = r x Q and <I> = $' U T. We call members of composite symbols. A 
configuration of T, on an input of length n, is a string of symbols, of length p(n), drawn 
from <!>. We can define a valid configuration in the standard way. In each valid configuration 
there can be only one composite symbol (i.e., from and that indicates the head position 
of T. A computation of T is a sequence of configurations which is either finite or infinite 
depending on whether the input is accepted or not. A computation starts in an initial 
configuration and each succeeding configuration is obtained by one move of T from the 
previous configuration. The initial configuration contains the input string and the first 
symbol in it is from T x Q indicating its head position is on the first cell. 

For given input a, we construct a FPM M. a such that M. a is a 2-HPBA and M. a accepts 
some infinite input with probability 1 iff T rejects a, i.e., T does not halt on a. Let a be an 
input to T of length n and let m = p(n). A state of the automaton A4 a is a pair of the form 
{i, s) where < i < m and s G <3>, or is in {q s , q r }; here q s is the initial state and is of rank 
and q r is the reject state and is of rank 2. The rank of states {(i, s) | < i < m and s£f} 
will be 1. Intuitively, if A4 a is in state (i,s) that denotes that i th element of the current 
configuration of the computation of T has value s. Note that s is in <£' or is in T. The input 
alphabet to M a is the set {0, ...,m — 1} x <3?' x {left, right} together with an additional 
input symbol r; that is each input to the automaton is r or is of the form (i, (b,q),d). 

Let a = o"0) ■■•)Cn-i be the input to T. The transitions of A4 a are defined as follows. 
From the initial state q s , on input r, there are transitions to the states {i,r,j), for each 
i € {0, ...,m — 1} where, rj = (ao,Qo) an d n = for < i < n, and is the blank symbol 
otherwise; the probability of each of these transitions is ^. Thus the input r sets up the 
initial configuration when A4 a is in the initial state q r . From every other state on input r 
there is a transition to the reject state q r with probability 1. Also, from the initial state 
q s , there is a transition to the reject state q r with probability 1 for all input symbols other 
than r. 

From any state of the form (J, (b,q)) on input of the form (i, (a,q'),d) the transition 
is defined as follows: if i = j, q = q', b = a and A(q,a) = (qi,a\,d), then there is a 
transition to the automaton state (j,a±); otherwise, the transition is to q r ; in either case, 
the probability of the transition is 1 . Note that if T halts then also there is a transition to 
q r . 

From any state of the form (j, 6), where b € T, on input symbol of the form (i, (a, q), d) 
the transitions are defined as follows: if either i = j — 1, d = right and A(q, a) = (q 1 , a 1 , d 1 ), 
or if i = j + 1, d = left and A(q, a) = (q', a', d') then the transition is to the state (j, (b, q')); 
otherwise the transition is back to (j, b) ; in both cases the probability of the transition is 1 . 

Suppose a is rejected, i.e., T does not terminate on a. Furthermore assume that 
the composite symbols in each successive configuration of the infinite computation of T 
on input a are (cto, qo), (a-i, Qi), ■■■ and they occur in positions io,... and the direction of 
the head movement is given by do,--- respectively. Then M a accepts the infinite string 
t{iq, (ao, qo), do), .., (i^, c/fc), d^), ■■■ with probability 1 and accepts all others with prob- 
ability less than 1. It is not difficult to see that if a is accepted by T, all input strings are 
accepted by M a with probability less than 1. The above reduction is clearly polynomial 
time bounded. (End proof of the claim.) □ 
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5.2. Almost-sure semantics. For a hierarchical PBA, the "partial" complementation op- 
eration for almost-sure semantics discussed in Section [5] yields a hierarchical PBA. There- 
fore using Theorem 15.61 we immediately get that a language C € L(HPBA =1 ) is ^-regular. 
Thanks to the topological characterization of L(HPBA =1 ) as a sub-collection of determin- 
istic languages, we get that L(HPBA =1 ) is exactly the class of languages recognized by 
deterministic finite-state Biichi automata. 

Theorem 5.8. L(HPBA =1 ) = Regular n Deterministic. 

Proof. The inclusion Regular n Deterministic C L(HPBA =1 ) follows immediately from the 
fact that any language in Regular n Deterministic is recognizable by a finite-state determin- 
istic Biichi automaton. For the reverse inclusion L(HPBA =1 ) C Regular n Deterministic, 
note that since L(PBA~ 1 ) C Deterministic, it suffices to show that L(HPBA _1 ) C Regular. 
Now, given L € L(HPBA =1 ), Lemma [4 . 2 1 immediately implies that there is an FPM At such 
that £>o(At) = S w \ L. Furthermore, it is easy to see from the proof of Lemma [4.21 that we 
can take At to be hierarchical given that L G L(HPBA _1 ). Now, thanks to Theorem ESI 
£>o(At) is w-regular which implies that L is also w-regular. □ 

The "partial" complementation operation also yields the complexity of emptiness and 
universality problems. 

Theorem 5.9. Given a RatHPBA, B, the problem of deciding whether C=\{B) = is 
PSPACE-complete. The problem of deciding whether C=\{B) = E w is NL-complete. 

(Upper Bounds.) The upper bounds are obtained by constructing the FPM At such 
that C=i(B) = S w \ £>o(At) as in the proof of Lemma 14.21 Now, At is hierarchical if B is 
hierarchical. The result now follows immediately from Theorem 15.71 

Proof. (Lower Bounds.) The NL-hardness of checking universality can be shown from 
NL-hardness of checking emptiness of deterministic finite state machines. Please recall that 
in the proof of Theorem [521 we had shown that given an FPM At such that At is a HPBA, 
the problem of checking whether £=i(At) is empty is PSPACE-hard. Thus, it follows 
immediately that checking emptiness of C=\{B) for a HPBA is PSPACE-hard. Q 

6. Conclusions 

In this paper, we investigated the power of randomization in finite state automata on infinite 
strings. We presented a number of results on the expressiveness and decidability problems 
under different notions of acceptance based on the probability of acceptance. In the case 
of decidability, we gave tight bounds for both the universality and emptiness problems. 
As part of future work, it will be interesting to investigate the power of randomization in 
other models of computations on infinite strings such as pushdown automata etc. Since 
the universality and emptiness problems are PSPACE-complete for almost-sure semantics, 
their application to practical systems needs further enquiry. 
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Appendix A. Properties of K in the proof of Lemma 13.81 

Lemma A.l. Let M be a deterministic 2-counter machine with a one way read only input 
tape whose configurations are encoded over alphabet X'. Let e be any rational such that 
< e < \. There is a PFA TZ over alphabet = £' U {@}, where @ such that 
(1) There exists an (computable) integer constant d > 2 such that if w is a valid and 

halting computation of M of length n, then the input string (w@) dn is accepted by TZ 

with probability > (1 — e), and 
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(2) Any input x = w\@W2@ ■ ■ ■ @w m @, where no Wi is a valid halting computation of M, 
is accepted by 1Z with probability at most e. 

Proof. We begin by recalling some details of the construction given in [CL89j . Given M 
and eo, ICL89) 4 ! give the construction of a PFA 1Z (eo) which expects the input to be of the 
form x = wi@W2@ ■ ■ ■ w m @, where Wi € (£')*. The automaton lZ M (eo) tries to check that 
each Wi is a valid, halting computation of M. Since TZ M (eo) has only finitely many states, it 
cannot reliably check consistency as it requires maintaining counter values. Instead TZ M (eo) 
plays a game on reading a computation Wi, wherein it tosses 0(n) coins; here n is \wi\. The 
game has four possible outcomes. 

(1) reject, when 1Z M (eo) discovers an error in Wi, 

(2) double wins, 

(3) sum wins, or 

(4) neither wins 

Details of how this game is played are beyond the scope of this paper, and the interested 
reader is referred to [CL89J. If Wi is a valid halting computation, the following properties are 
known to hold: (a) reject is never an outcome of the game, (b) the probability of outcome 
double wins is equal to the probability of the outcome single wins, which we will denote 
by p in this proof, and (c) p > 2 _4n . The automaton 1Z M (eo) maintains two counters D 
and S that take values between and q — q is a constant integer whose value will be fixed 
later in the next paragraph. After playing the game on Wi, 1Z M (eo) takes the following 
steps depending on the outcome of the game. If the outcome is reject, then 1Z M (eo) moves 
to a special reject state q r and ignores the rest of the input. If the outcome is double 
wins then counter D is incremented, and if the outcome is sum wins then counter S is 
incremented. When the outcome is neither wins, the counters D and S are left unchanged. 
The automaton 1Z M (eo) then checks the value of D and S — if either of them are q then it 
ignores the rest of the input and does not play anymore games; on the other hand if both 
S and D are less than q then it processes Wi + \ by playing the game. 

After processing the entire input x = wi@W2@ ■ ■ ■ w m @, the automaton TZ M (eo) decides 
to accept or reject X ELS follows. 

(1) If 1Z M (eo) is in the reject state q r (i.e., one of the games played had outcome reject) 
then x is rejected. 

(2) x is also rejected if either (a) both S and D are < q, or (b) D = q and S = 0. 

(3) In all other cases, x is accepted, i.e., when S = q or when D = q and S ^ 0. 

The constant q is fixed to ensure that the following property holds: Assuming that after 
processing x at least one of the counters D or S is q, the probability that x is accepted is 
(a) > 1 — eo if all the Wis are valid, halting computations of M, and (b) < eo if none of the 
u>jS are valid, halting computations. 

In proving this lemma, we will take 7Z to be the PFA obtained by taking eo = |, i.e., 
1Z = 1Z M (^)- We will first show that any input x = w\@W2@ ■ ■ ■ @w m @, where no Wi is 
a valid halting computation of M, is accepted by 1Z with probability at most e. Now, we 
know by construction of 1Z and properties stated above, assuming that one of S or D is q, 
the probability that x is accepted is at most |. Moreover, when both S and D are < q, we 
know (by construction) that 7Z rejects x. Thus the second condition in the lemma holds. 



The construction in |CL89| is actually carried out only for deterministic 2-counter machines without an 
input tape. However, the construction easily carries over to deterministic 2-counter machines with one-way 
read only input tape. 
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We will now prove the first condition. Consider input x = wi@W2@ ■ ■ ■ @w m @, where 
all the Wis are valid, halting computations of M of (equal) length n. Taking psd to be the 
probability that either D or S is q after processing x, the probability that x is accepted by 
1Z is at least psr>(l — §)• Thus, to prove the first condition, all we need to show is that if 
m is larger than d n , for some fixed, computable d, then psr>(l — |) > 1 — e. In other words, 
we need to prove that for such large m, psd > 5, where S = (1 — e)/(l — |). 

Let psd = 1 — Psd- So psd is the probability that both S and D are less than q 
after x is processed. Recall that p is the probability that D is incremented after a single 
computation Wi is processed by 1Z. Moreover, since all the Wis are assumed to be valid 
computations, p is also the probability that S is incremented after playing one game. Thus, 
we can say that 

E E ( s 7d) ( s+ s D ) » s+d ^ ~ W"- ,S+D) 

0<S<gO<D<g v 7 v 7 

where ( \ J is the number of ways of choosing t objects from k objects. We can simplify 



the above expression as follows. 

m \ (S + D 

,0<S<q^0<D<q I g + D J I g 



Eo<s<qE <D<q ( Q Tn) (°;^)p S+D (l-2pr- (5+D) 



= hlf) Z < k <2q-2 (fc) (2p)*(l " 2?)™"* 

< ( 2 ;_" 2 ) Eo< fe < 29 - 2 (7) (2p) fc (l - 2p) m ~ k 



In the above reasoning, the second line follows from the observation that since S+D < 2q—2, 
^ g ^ < f ^ ^ , and the last line follows from the fact that mm (k,q-i) < 1 _ Also 

observe that Ylo<k<2q-2 (^j^J (2p) fe (l — 2p) m ~ k is nothing but the cumulative distribution 

function for a binomial distribution with parameters 2p and m. Taking m such that 2q — 2 < 
m(2p), we upper bound the above expression using Chernoff bounds as follows, 

2q - 2\ ^ (m\ (n , kn , m _ fc ^ /2 9 - 2\ , (2pm - (2g - 2)) 2 



^ s ( 2 ;:f) e (?) - £ ( 2 ? Ei) »p<- 



0<fe<2(j-2 



4pm 



Let p = ^ 2 ^_ ^ • Now p • exp(- ^ ym 4 ^ 2 ^ ) < 1 — 5 when m > 2 \ where 

9 = log(y^). Finally since p > 2~ 4n , we get the desired d for the lemma. □ 
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